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CRYSTALU2A.BLE COMPOSITIONS COMPRISING A HEPATITIS C VIRUS NS3 PROTEASE 
DOMAIN/NS4A COMPLEX AND CRYSTALS THEREBY OBTAINED 

5 TECHNICAL FIELD OF INVENTION 

The present invention relates to compositions 
and crystals of a hepatitis C virus protease in complex 
with its viral cof actor. This invention also relates 
10 to methods of using the structure coordinates of 

hepatitis C virus protease in complex with a synthetic 
NS4A to solve the structure of similar or homologous 
proteins or protein complexes. 

15 BACKGROUND OF THE INVENTION 

Infection by hepatitis C virus (HCV) is a 
compelling human medical problem. HCV is recognized as 
the causative agent for most cases of non-A, non-B 

20 hepatitis, with an estimated human seroprevalence of 1% 
globally [Choo, Q.-L. et al., "Isolation of a cDNA 
Clone Derived From a Blood-Borne Non-A # Non-B Viral 
Hepatitis Genome", Science , 244, pp. 359-362 (1989); 
Kuo, G. et al«, "An Assay for Circulating Antibodi s to 

25 a Major Etiologic Virus of Human Non-A, Non-B 

Hepatitis", Science , 244, pp. 362-364 (1989); Purcell, 
R.H., "Hepatitis C virus: Historical persp ctiv and 
current concepts", FEMS Microbiology Reviews , 14, pp. 
181-192 (1994); Van der Poel, C.L., ^Hepatitis C Virus. 

30 Epidemiology, Transmission and Prevent! n in H patitis 
C virus. Current Studies in Hematology and Blood 
Transfusion, H.W. Reesink, Ed., (Bas 1: Karger), pp. 
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infected in the United States alone [Alter , M.J, and 
Mast, E.E., *The Epidemiology of Viral Hepatitis in 
the United States, Gastroenterol, Clin. North Am. , 23 f 
pp. 437-455 (1994)] . 
5 Upon first exposure to HCV only about 20% of 

infected individuals develop acute clinical hepatitis 
while others appear to resolve the infection 
spontaneously. In most instances, however, the virus 
establishes a chronic infection that persists for 

10 decades (Iwarson, S. v The Natural Course of Chronic 

Hepatitis"/ FEMS Microbiology Reviews / 14/ pp. 201-204 
(1994)]. This usually results in recurrent and 
progressively worsening liver inflammation/ which often 
leads to more severe disease states such as cirrhosis 

15 and hepatocellular carcinoma [Kew, M.C., ^Hepatitis C 
and Hepatocellular Carcinoma"/ FEMS Microbiology 
Reviews / 14/ pp. 211-220 (1994); SaitO/ I., et al. 
"Hepatitis C Virus Infection is Associated with the 
Development of Hepatocellular Carcinoma"/ Proc. Natl. 

20 Acad. Sci. USA 87/ pp. 6547-6549 (1990)]. Currently, 
there are no broadly effective treatments for the 
debilitating progression of chronic HCV. 

The HCV genome encodes a polyprotein of 3010- 
3033 amino acids (Figure 1) [Choo f Q.-L./ et al.. 

25 "Genetic Organization and Diversity of the Hepatitis C 
Virus"/ Proc. Natl. Acad. Sci. USA , 88/ pp. 2451-2455 
(1991); Kato, N. et al., Molecular Cloning of the Human 
Hepatitis C Virus Genome From Japanese Patients with 
Non-A/ Non-B Hepatitis"/ Proc. Natl. Acad. Sci. USA / 

30 87, pp. 9524-9528 (1990); Takamizawa/ A. t al., 

"Structure and Organization of the H patitis C Virus 
Genome Isolat d From Human Carri rs"/ J. Virol. , 65/ 
pp. 1105-1113 (1991)]. The HCV nonstructural (NS) 
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proteins provide catalytic machinery for viral 
replication. The NS proteins are derived by 
proteolytic cleavage of the polyprotein 
[Bartenschlager, R. et al. r ^Nonstructural Protein 3 of 
5 the Hepatitis C Virus Encodes a Serine-Type Proteinase 
Required for Cleavage at the NS3/4 and NS4/5 
Junctions", J. Virol . , 67, pp. 3835-3844 (1993); 
Grakoui, A. et al. ""Characterization of the Hepatitis C 
Virus-Encoded Serine Proteinase: Determination of 

10 Proteinase-Dependent Polyprotein Cleavage Sites", J^ 
Virol. , 67, pp. 2832-2843 (1993); Grakoui, A. et al., 
Expression and Identification of Hepatitis C Virus 
Polyprotein Cleavage Products", J. Virol. , 67, pp. 
1385-1395 (1993); Tomei, L. et al. f *NS3 is a serine 

15 protease required for processing of hepatitis C virus 
polyprotein", J. Virol. , 67, pp. 4017-4026 (1993)]. 

The HCV NS protein 3 (NS3) contains a serin 
protease activity that helps process the majority of 
the virai enzymes, and is thus considered essential for 

20 viral replication and infectivity. It is known that 
mutations in the yellow fever virus NS3 protease 
decreases viral infectivity [Chambers, T.J. et. al., 
"Evidence that the N-terminal Domain of Nonstructural 
Protein NS3 From Yellow Fever Virus is a Serine 

25 Protease Responsible for Site-Specific Cleavages in the 
Viral Polyprotein", Proc. Natl. Acad. Sci. USA , 87, pp. 
8898-8902 (1990) ] . The first 181 amino acids of NS3 
(residues 1027-1207 of the viral polyprotein) have been 
shown to contain th serine proteas domain of NS3 that 

30 processes all four downstr am sites of the HCV 

polyprotein (Figure 1) [C. Lin et al., "Hepatitis C 
Virus NS3 Serine Proteinas : Trans-Cl avage 
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Requirements and Processing Kinetics", J. Virol. , 68, 
pp. 8147-8157 (1994)1. 

NS3 is associated with a cofactor, NS4A. 
NS4A seems critical to the activity of NS3, enhancing 
5 the proteolytic efficiency of NS3 at all of the 
cleavage sites. NS4A is a 54 residue amphipathic 
peptide, with a hydrophobic N-terminus and a 
hydrophilic C-terminus [Failla, C. et al., *Both NS3 
and NS4A are Required for Proteolytic Processing of 

10 Hepatitis C Virus Nonstructural Proteins", J. Virol. , 
68, pp. 3753-3760 (1994)]. Its function appears 
complex, possibly assisting in the membrane- 
localization of NS3 and other viral replicase 
components [Lin, C. et al. tt A Central Region in the 

15 Hepatitis C Virus NS4A Protein Allows Formation of an 
Active NS3-NS4A Serine Proteinase Complex In Vivo and 
In Vitro ", J. Virol. , 69, pp. 4373-4380 (1995b); 
Shimizu, Y. et al., "'Identification of the Sequence on 
NS4A Required for Enhanced Cleavage of the NS5A/5B Site 

20 by Hepatitis C Virus NS3 Protease", J. Virol. , 70, pp. 
127-132 (1996); Tanji, Y. et al., "Hepatitis C Virus- 
Encoded Nonstructural Protein NS4A has Versatile 
Functions in Viral Protein Processing", J. Virol. , 69, 
pp. 1575-1581 (1995)) but its best characterized 

25 function is that of a cofactor for the NS3 protease. 

The current understanding of HCV has not led 
to satisfactory treatments for HCV infection. The 
prospects for eff ctive anti-HCV vaccines remain 
unc rtain. The only established therapy for HCV 

30 disease is interferon tr atment. However, interf rons 
have significant sid ff cts tJanssen, H. L. A., et 
al. "Suicide Associated with Alfa-Interferon Therapy 
for Chronic Viral H patitis", J. Hepatol. . 21, pp. 241- 
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243 (1994)]; Renault, P.F. and Hoof nagle, J.H., "Side 
effects of alpha interferon. Seminars in Liver Disease 
9, 273-277, (1989)] and induce long term remission in 
only a fraction (- 25%) of cases [Weiland, 0. 
5 * Interferon Therapy in Chronic Hepatitis C Virus 
Infection", FEM5 Microbiol. Rev, , 14, pp. 279-288 
(1994)]. Thus, there is a need for more effective 
anti-HCV therapies. 

The NS3 protease is considered a potential 

10 target for antiviral agents. However, drug discovery 
efforts directed towards the NS3 protein have been 
hampered by the lack of structural information about 
NS3 and its complex with NS4A. Such structural 
information would provide valuable information in 

15 discovery of HCV NS3 protease inhibitors. However, 

efforts to determine the structure of HCV NS3 protease 
have been hampered by difficulties in obtaining 
sufficient quantities of pure active enzyme 
[Steinkuhler, C. et al., "In Vitro Activity of 

20 Hepatitis C Virus Protease NS3 Purified from 

Recombinant Baculovirus-Infected Sf9 Cells", J. Biol 
Chem. , pp. 637-6273 (1996)]. There have been no 
crystals reported of any NS3 or NS3 protease domain 
protein. Thus, x-ray crystallographic analysis of such 

25 proteins has not been possible. 

SUMMARY OF THE INVENTION 

Applicants have solved this problem by 
30 providing, for the first tim , compositions comprising 
a hepatitis C virus (HCV) NS3 protease-like polypeptide 
complex d with a NS4A-like peptide and methods for 
making such compositions. 
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The invention also provides crystals of a HCV 
NS3 protease-like polypeptide/NS4A-like peptide complex 
and methods for making such crystals. 

The invention also provides the structure 
5 coordinates of a HCV NS3 protease- like 
polypeptide/NS4A-like peptide complex. 

The invention also provides a method for 
determining at least a portion of the three-dimensional 
structure of molecules or molecular complexes which 
10 contain at least some structurally similar features to 
a HCV NS3 serine protease domain. 

BRIEF DESCRIPTION OF THE FIGURES 

15 Figure 1 depicts HCV polyprotein processing. 

The locations of the HCV structural and nonstructural 
proteins are marked on a diagram of the 3011 amino acid 
polypeptide. Cleavages between the structural proteins 
by cellular signal peptidases are marked by asterisks. 

20 Cleavage between NS2 and NS3 is mediated by the NS2/NS3 
metallo-protease. The NS3 serine protease is 
responsible for cleavages between NS3 and NS4A, NS4A 
and NS4B, NS4B and NS5A, and NS5A and NS5B. 

Figure 2 depicts stereo ribbon diagrams of 

25 the NS3/NS4A complex. The view is into the active site 
cleft of the enzyme. Side-chains of active site 
residues His-1083, Asp-1107, and Ser-1165, along with 

Zn ++ ligands Cys-1123, Cys-1125, and Cys-1171 are 
displayed in ball-and-stick representation. Zn ++ , its 
30 H20 ligand, and the p- strand formed by NS4A are also 
shown. 
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Figure 3 lists the atomic structure 
coordinates for hepatitis C virus recombinant, 
truncated nonstructural protein 3 (hereafter referred 
to as tNS3) in complex with a synthetic peptide of the 
5 central region of the nonstructural protein 4A 

(hereafter referred to as sNS4A) as derived by X-ray 
diffraction from crystals of that complex (hereafter 
referred to as tNS3/sNS4A) . The preparation of the 
complex is described in Examples 1 and 2. The 
10 following abbreviations are used in Figure 3: 

"Atom type" refers to the element whose 
coordinates have been determined. Elements are defined 
by the first letter in the column except for zinc which 
is defined by the letters *Zn". 
15 "X, Y, Z" crystallographically define the 

atomic position determined for each atom. 

"B* is a thermal factor that measures 
movement of the atom around its atomic center. 

w Occ" is an occupancy factor that refers to 
20 the fraction of the molecules in which each atom 

occupies the position specified by the coordinates. A 
value of "1" indicates that each atom has the same 
conformation, i.e., the same position, in all molecul s 
of the crystal. 

25 Figure 4 shows a diagram of a system used to 

carry out the instructions encoded by the storage 

medium of Figures 5 and 6. 

Figure 5 shows a cross section of a magn tic 

storage medium. 
30 Figure 6 shows a cross s ction of a 

optically-readable data storage medium. 
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DETAILED DESCRIPTION OF THE INVENTION 
The following abbreviations are used 
throughout the application: 



A 


St 


Ala = 


Alanine 


T 




Thr « 


Threonine 


V 




Val = 


Valine 


C 


cs 


Cys «= 


Cysteine 


L 




Leu « 


Leucine 


Y 


ss 


Tyr « 


Tyrosine 


I 


es 


lie « 


Isoleucine 


N 




Asn «= 


Asparagine 


P 


cs 


Pro « 


Proline • 


Q 


S3 


Gin « 


Glutamine 


F 




Phe * 


Phenylalanine 


D 




Asp » 


Aspartic Acid 


W 




Trp = 


Tryptophan 


E 


SS 


Glu « 


Glutamic Acid 


M 




Met = 


Methionine 


K 




Lys » 


Lysine 


G 




Gly = 


Glycine 


R 




Arg - 


Arginine 


S 


as 


Ser * 


Serine 


H 


SS 


His * 


Histidine 



HCV » hepatitis C virus 

Additional definitions are set forth in the 
5 specification where necessary. 

In order that the invention described herein 
may be more fully understood, the following detailed 
description is set forth. 

Applicants have solved the above problems by 
10 providing, for the first time, crystallizable 

compositions comprising a HCV NS3 protease-like 
polypeptide in complex with a NS4A-like peptide* 

Thus, in one embodiment of this invention is 
provided a composition comprising a hepatitis C virus 
15 NS3-lik polypeptide in complex with an NS4A-like 
p ptid • 

The HCV NS3-lik polypeptide portion of the 
complex is any polypeptid which has th serin 
protease activity of the naturally occurring HCV NS3A 
20 protease, p rticularly th ability to cleav the HCV 
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polyprotein. It includes HCV NS3, NS3 protease domain 
polypeptides and NS3 protease domain-like polypeptides. 

As used herein, the terms "HCV NS3" and *NS3" 
refers to the hepatitis C virus nonstructural- 3 protein 
5 as defined in Lin, C. et al., ^Hepatitis C Virus NS3 
Serine Proteinase: Trans-Cleavage Requirements and 
Processing Kinetics", J. Virol, , 68, pp. 8147-8157 
(1994) . 

The term ^NS3 protease domain polypeptide" 

10 refers to a truncated, serine protease portion of NS3 
as defined in [Bartenschlager, R. et al., 
^Nonstructural Protein 3 of the Hepatitis C Virus 
Encodes a Serine-Type Proteinase Required for Cleavage 
at the NS3/4 and NS4/5 Junctions", J. Virol. , 67, pp. 

15 3835-3844 (1993); Grakoui, A. et al. ^Characterization 
of the Hepatitis C Virus-Encoded Serine Proteinase: 
Determination of Proteinase-Dependent Polyprotein 
Cleavage Sites", J. Virol. , 67, pp. 2832-2843 (1993); 
Grakoui, A. et al., Expression and Identification of 

20 Hepatitis C Virus Polyprotein Cleavage Products", J. 
Virol . , 67, pp. 1385-1395 (1993); Tomei, L. et al., 
M NS3 is a serine protease required for processing of 
hepatitis C virus polyprotein", J. Virol. , 67, pp. 
4017-4026 (1993)]. The disclosure of each of these 

25 documents is herein incorporated by reference. 

The term "NS3 protease domain-like 
polypeptides" refers to polypeptides that differ from 
NS3 protease domain polypeptides by having amino acid 
deletions, substitutions, and additions, but which 

30 r tain the serine protease activity of NS3. 

Preferably, the NS3-like polypeptide in the 
compositions of this invention is tNS3, a r combinantly 
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produced hepatitis C virus protease domain protein that 
is prepared as described herein. 

The NS4A-like peptide portion of the 
compositions of this invention is any peptide or 
5 peptide mimetic that is capable of acting as a NS4A 
cof actor for the NS3. These include NS4A, peptide 
fragments thereof and other peptides that differ from 
NS4A by having amino acid deletions, substitutions, and 
additions, while retaining the above-described 
10 activity. 

As used herein the term "NS4A" refers to the 
hepatitis C virus nonstructural protein 4A which acts 
as a cofactor for NS3 protease [Failla, C. et al., 
^Both NS3 and NS4A are Required for Proteolytic 
15 Processing of Hepatitis C Virus Nonstructural Proteins" 
J. Virol, 68, pp. 3753-3760 (1994); Lin, C. et al., 
^Hepatitis C Virus NS3 Serine Proteinase: Trans- 
Cleavage Requirements and Processing Kinetics" J. 
Virol . 68, pp. 8147-8157 (1994b) ] 
20 Preferably, the NS4A-like peptide is sNS4A, 

the synthetic peptide H-KKGSWIVGRIVLSGKPAIIPKK-OH. 
This peptide encompasses the essential NS3 protease 
domain residues of NS4A. 

Both the NS3-like polypeptide and the NS4A- 
25 like peptide may be produced by any well-known method, 
including synthetic methods, such as solid phase, 
liquid phase and combination solid phase/liquid phase 
syntheses; recombinant DMA methods, including cDNA 
cloning, optionally combined with site directed 
30 mutagenesis; and/or purification of the natural 

products, optionally combined with enzymatic cleavage 
methods to produce fragments of naturally occurring NS3 
and NS4A. 
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According to a preferred embodiment/ the 
compositions of this invention are crystallizable. In 
this preferred embodiment all of the preferred choices 
for the NS3-like polypeptide and the NS4A-like peptide 
5 are identical to those indicated above. 

Advantageously, the crystallizable 
composition provided by this invention are amenable to 
x-ray crystallography. Thus, this invention also 
provides the three-dimensional structure of an HCV NS3- 

10 like polypeptide/NS4A-like peptide complex, 

specifically an HCV tNS3/sNS4A complex, at 2.5 A 
resolution. Importantly, this has provided for the 
first time, information about the shape and structure 
of the NS3 protease domain. 

15 The three-dimensional structure of the HCV 

tNS3/sNS4A complex of this invention is defined by a 
set of structure coordinates as set forth in Figure 3. 
The term "structure coordinates" refers to Cartesian 
coordinates derived from mathematical equations related 

20 to the patterns obtained on diffraction of a 

monochromatic beam of X-rays by the atoms (scattering 
centers) of an tNS3/sNS4A complex in crystal form. The 
diffraction data are used to calculate an electron 
density map of the repeating unit of the crystal. The 

25 electron density maps are then used to establish the 
positions of the individual atoms of the tNS3/sNS4A 
enzyme or enzyme complex. 

Those of skill in the art will understand 
that a set of structure coordinates for an enzyme or an 

30 enzyme-compl x or a portion th reof , is a r lative set 
of points that define a shape in three dimensions. 
Thus, it is possibl that an entirely different set of 
coordinates could define a similar or identical shape. 
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Moreover, slight variations in the individual 
coordinates will have little effect on overall shape. 

The variations in coordinates discussed above 
may be generated because of mathematical manipulations 
5 of the structure coordinates. For example/ the 

structure coordinates set forth in Figure 3 could be 
manipulated by crystallographic permutations of the 
structure coordinates, fractionalization of the 
structure coordinates, integer additions or 

10 subtractions to sets of the structure coordinates, 
inversion of the structure coordinates or any 
combination of the above. 

Alternatively, modifications in the crystal 
structure due to mutations, additions, substitutions, 

15 and/or deletions of amino acids, or other changes in 
any of the components that make up the crystal could 
also account for variations in structure coordinates. 
If such variations are within an acceptable standard 
error as compared to the original coordinates, the 

20 resulting three-dimensional shape is considered to be 
the same. 

Various computational analyses are therefore 
necessary to determine whether a molecule or molecular 
complex or a portion thereof is sufficiently similar to 

25 all or parts of the NS3-like polypeptide/NS4A-like 

peptide structure described above as to be considered 
the same. Such analyses may be carried out in current 
software applications, such as the Molecular Similarity 
application of QUANTA (Molecular Simulations Inc., San 

30 Diego, CA) version 4.1, and as described in th 
accompanying User's Guid . 

The Molecular Similarity application permits 
comparisons b twe n different structures, different 
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conformations of the same structure, and different 
parts of the same structure. The procedure used in 
Molecular similarity to compare structures is divided 
into four steps: 1) load the structures to be 
5 compared; 2) define the atom equivalences in these 
structures; 3) perform a fitting operation; and 4) 
analyze the results. 

Each structure is identified by a name. On 
structure is identified as the target (i.e., the fixed 

10 structure); all remaining structures are working 
structures (i.e., moving structures). Since atom 
equivalency within QUANTA is defined by user input, for 
the purpose of this invention we will define equivalent 
atoms as protein backbone atoms (N, Ca, C and 0) for 

15 all conserved residues between the two structures being 
compared. We will also consider only rigid fitting 
operations. 

When a rigid fitting method is used, the 
working structure is translated and rotated to obtain 

20 an optimum fit with the target structure. The fitting 
operation uses an algorithm that computes the optimum 
translation and rotation to be applied to the moving 
structure, such that the root mean square difference of 
the fit over the specified pairs of equivalent atom is 

25 an absolute minimum. This number, given in angstroms, 
is reported by QUANTA. 

For the purpose of this invention, any 
molecule or molecular complex that has a root mean 
square deviation of conserved residue backbone atoms 

30 (N, ca, C, 0) of less than 1.5 A when superimposed on 
the relevant backbone atoms describ d by structur 
coordinates listed in Figure 3 ar consid red 
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identical. More preferably, the root mean square 
deviation is less than 1.0 A. 

The term "root mean square deviation" means 
the square root of the arithmetic mean of the squares 
5 of the deviations from the mean. It is a way to 
express the deviation or variation from a trend or 
object. For purposes of this invention, the "root m an 
square deviation" defines the variation in the backbone 
of a protein or protein complex from the relevant 

10 portion of the backbone of the NS3-like polypeptide 
portion of the complex as defined by the structure 
coordinates described herein. 

Once the structure coordinates of a protein 
crystal have been determined they are useful in solving 

15 the structures of other crystals. 

Thus, in accordance with the present 
invention, the structure coordinates of a NS3-like 
polypeptide/NS4A-like peptide complex, and in 
particular a tNS3/sNS4A complex, and portions thereof 

20 is stored in a machine-readable storage medium. Such 
data may be used for a variety of purposes, such as 
drug discovery and x-ray crystallographic analysis or 
protein crystal. 

Accordingly, in. one embodiment of this 

25 invention is provided a machine-readable data storage 
medium comprising a data storage material encoded with 
the structure coordinates set forth in Figure 3. 

Figure 4 demonstrates one version of these 
embodiments. Syst m 10 includes a computer 11 

30 comprising a central processing unit {"CPU") 20, a 

working memory 22 which may be, e.g, RAM (random-access 
memory) or "core" memory, mass storage memory 24 (such 
as on or mor disk drives or CD-ROM drives), one or 
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more cathode- ray tube ("CRT") display terminals 26, one 
ox more keyboards 28, one or more input lines 30, and 
one or more output lines 40, all of which are 
interconnected by a conventional bidirectional system 
5 bus 50. 

Input hardware 36, coupled to computer 11 by 
input lines 30, may be implemented in a variety of 
ways. Machine-readable data of this invention may be 
inputted via the use of a modem or modems 32 connected 

10 by a telephone line or dedicated data line 34. 

Alternatively or additionally, the input hardware 36 
may comprise CD-ROM drives or disk drives 24. In 
conjunction with display terminal 26, keyboard 28 may 
also be used as an input device. 

15 Output hardware 46, coupled to computer 11 by 

output lines 40, may similarly be implemented by 
conventional devices. By way of example, output 
hardware 46 may include CRT display terminal 26 for 
displaying a graphical representation of a binding 

20 pocket of this invention using a program such as QUANTA 
as described herein. Output hardware might also 
include a printer 42, so that hard copy output may be 
produced, or a disk drive 24, to store system output 
for later use. 

25 In operation, CPU 20 coordinates the use of 

the various input and output devices 36, 46, 
coordinates data accesses from mass storage 24 and 
accesses to and from working memory 22, and determin s 
the sequenc of data processing steps. A numb r of 

30 programs may b used to proc ss the machine-readabl 

data of this invention. Such programs are discussed in 
reference to the computational m thods of drug 
discovery as described herein. Specific refer nc s to 
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components of the hardware system 10 are included as 
appropriate throughout the following description of the 
data storage medium. 

Figure 5 shows a cross section of a magnetic 
5 data storage medium 100 which can be encoded with a 
machine-readable data that can be carried out by a 
system such as system 10 of Figure 4. Medium 100 can 
be a conventional floppy diskette or hard disk, having 
a suitable substrate 101, which may be conventional, 

10 and a suitable coating 102, which may be conventional, 
on one or both sides, containing magnetic domains (not 
visible) whose polarity or orientation can be altered 
magnetically. Medium 100 may also have an opening (not 
shown) for receiving the spindle of a disk drive or 

15 other data storage device 24. 

The magnetic domains of coating 102 of medium 
100 are polarized or oriented so as to encode in manner 
which may be conventional, machine readable data such 
as that described herein, for execution by a system 

20 such as system 10 of Figure 4* 

Figure 6 shows a cross section of an 
optically-readable data storage medium 110 which also 
can be encoded with such a machine-readable data, or 
set of instructions, which can be carried out by a 

25 system such as system 10 of Figure 4. Medium 110 can 
be a conventional compact disk read only memory 
(CD-ROM) or a rewritable medium such as a 
magneto-optical disk which is optically readabl and 
magneto-optically writabl . Medium 100 preferably has 

30 a suitable substrat 111, which may be conventional, 

and a suitabl coating 112, which may be conventional, 
usually of one side of substrate 111. 
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In the case of CD-ROM, as is well known, 
coating 112 is reflective and is impressed with a 
plurality of pits 113 to encode the machine-readable 
data* The arrangement of pits is read by reflecting 
5 laser light off the surface of coating 112. A 
protective coating 114 , which preferably is 
substantially transparent, is provided on top of 
coating 112 . 

In the case of a magneto-optical disk, as is 

10 well known, coating 112 has no pits 113, but has a 
plurality of magnetic domains whose polarity or 
orientation can be changed magnetically when heated 
above a certain temperature, as by a laser (not shown). 
The orientation of the domains can be read by measuring 

15 the polarization of laser light reflected from coating 
112. The arrangement of the domains encodes the data 
as described above. 

For the first time, the present invention 
permits the use of structure-based or rational drug 

20 design techniques to design, select, and synthesize 

chemical entities, including inhibitory compounds that 
are capable of binding to HCV NS3, NS4A, NS3/NS4A 
complex, or any portion thereof. 

One particularly useful drug design technique 

25 enabled by this invention is iterative drug design. 
Iterative drug design is a method for optimizing 
associations between a protein and a compound by 
determining and evaluating the thr e-dimensional 
structur s of successive sets of protein/ compound 

30 complexes. 

Thos of skill in th art will realize that 
association of natural ligands or substrates with the 
binding pockets of their corr sponding r ceptors or 
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enzymes is the basis of many biological mechanisms of 
action. The term ^binding pocket", as used herein/ 
refers to a region of a molecule or molecular complex, 
that/ as a result of its shape, favorably associates 
5 with another chemical entity or compound* Similarly, 
many drugs exert their biological effects through 
association with the binding pockets of receptors and 
enzymes. Such associations may occur with all or any 
parts of the binding pockets. An understanding of such 

10 associations will help lead to the design of drugs 

having more favorable associations with their target 
receptor or enzyme, and thus, improved biological 
effects. Therefore, this information is valuable in 
designing potential ligands or inhibitors of receptors 

15 or enzymes, such as inhibitors of HCV NS3-like 
polypeptides, and more importantly HCV NS3. 

The term ^associating with" refers to a 
condition of proximity between chemical entities or 
compounds, or portions thereof. The association may b 

20 non-covalent — wherein the juxtaposition is 

energetically favored by hydrogen bonding or van der 
Waals or electrostatic interactions — or it may be 
covalent. 

In iterative drug design, crystals of a 
25 series of protein/compound complexes are obtained and 
then the three-dimensional structures of each complex 
is solved. Such an approach provides insight into the 
association between the proteins and compounds of each 
compl x. This is accomplished by selecting compounds 
30 with inhibitory activity, obtaining crystals of this 
new prot in/ compound complex, solving the three- 
dimensional structure of the complex, and comparing the 
associations between the new protein/compound complex 
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and previously solved protein/compound complexes. By 
observing how changes in the compound affected the 
protein/compound associations, these associations may 
be optimized. 

5 In some cases, iterative drug design is 

carried out by forming successive protein-compound 
complexes and then crystallizing each new complex. 
Alternatively, a pre-formed protein crystal is soaked 
in the presence of an inhibitor, thereby forming a 

10 protein/ compound complex and obviating the need to 

crystallize each individual protein/ compound complex. 
Advantageously, the HCV NS3-like polypeptide/NS4A-like 
peptide crystals, and in particular the tNS3/sNS4A 
crystals, provided by this invention may be soaked in 

15 the presence of a compound or compounds, such as NS3 
protease inhibitors, to provide NS3-like 
polypeptide/NS4A-like peptide /compound crystal 
complexes* 

As used herein, the term "soaked" refers to a 
20 process in which the crystal is transferred to a 

solution containing the compound of interest. 

In another embodiment of this invention is 

provided a method for preparing a composition 

comprising a NS3-like polypeptide protein comprising 
25 the steps described in Examples 1 and 2. Preferably, 

the composition comprises a NS3-like polypeptide in 

complex with a NS4A-like peptide. 

The structure coordinates set forth in Figur 

3 can also be used to aid in obtaining structural 
30 information about another crystallized molecule or 

molecular complex. This may be achieved by any of a 

number of well-known techniques, including mol cular 

replacement. 
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The structure coordinates set forth in Figure 
3 can also be used for determining at least a portion 
of the three-dimensional structure of molecules or 
molecular complexes which contain at least some 
5 structurally similar features to HCV NS3. In 

particular, structural information about another 
crystallized molecule or molecular complex may be 
obtained. This may be achieved by any of a number of 
well-known techniques, including molecular replacement. 
10 Therefore, in another embodiment this 

invention provides a method of utilizing molecular 
replacement to obtain structural information about a 
crystallized molecule or molecular complex whose 
structure is unknown comprising the steps of: 

15 a) generating an X-ray diffraction pattern 

from said crystallized molecule or molecular complex; 
and 

b) applying at least a portion of the 
structure coordinates set forth in Figure 3 to the 

20 X-ray diffraction pattern to generate a 

three-dimensional electron density map of the molecule 
or molecular complex whose structure is unknown. 

Preferably, the crystallized molecule or 
molecular complex comprises a NS3-like polypeptide and 

25 a NS4A-like peptide. More preferably, the crystallized 
molecule or molecular complex is obtained by soaking a 
crystal of this invention in a solution. 

By using molecular replacement, all or part 
of the structure coordinates of the tNS3/sNS4A complex 

30 provided by this invention (and set forth in Figure 3) 
can be used to determine th structure of a 
crystallized molecule or molecular complex whose 
structur is unknown more quickly and fficiently than 
attempting to determin such information ab Initio. 
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Molecular replacement provides an accurate 
estimation of the phases for an unknown structure. 
Phases are a factor in equations used to solve crystal 
structures that can not be determined directly.* 
5 Obtaining accurate values for the phases, by methods 
other than molecular replacement, is a time-consuming 
process that involves iterative cycles of 
approximations and refinements and greatly hinders th 
solution of crystal structures* However, when the 

10 crystal structure of a protein containing at least a 

homologous portion has been solved, the phases from the 
known structure provide a satisfactory estimate of the 
phases for the unknown structure. 

Thus, this method involves generating a 

15 preliminary model of a molecule or molecular complex 
whose structure coordinates are unknown, by orienting 
and positioning the relevant portion of the tNS3/sNS4A 
complex according to Figure 3 within the unit cell of 
the crystal of the unknown molecule or molecular 

20 complex so as best to account for the observed X-ray 
diffraction pattern of the crystal of the molecule or 
molecular complex whose structure is unknown. Phases 
can then be calculated from this model and combined 
with the observed X-ray diffraction pattern amplitudes 

25 to generate an electron density map of the structure 
whose coordinates are unknown. This, in turn, can b 
subjected to any well-known model building and 
structure refinement techniques to provide a final, 
accurate structure of the unknown crystallized molecule 

30 or molecular complex [E. Lattman, "Us of the Rotation 
and Translation Functions", in Meth. Enzymol ., 115, pp. 
55-77 (1985); M. G. Rossmann, ed., "The Molecular 
Replacement M thod", Int. Sci. Rev. Ser. , No. 13, 
Gordon & Breach, New York (1972)]. 
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The structure of any portion of any 
crystallized molecule or molecular complex that is 
sufficiently homologous to any portion of the 
tNS3/sNS4A complex can be solved by this method. 
5 In a preferred embodiment, the method of 

molecular replacement is utilized to obtain structural 
information about a molecule or molecular complex, 
wherein the complex comprises a NS3~like polypeptide. 
Preferably the NS3-like polypeptide is tNS3 or 

10 homologue thereof* 

The structure coordinates of tNS3/sNS4A as 
provided by this invention are particularly useful in 
solving the structure of other crystal forms of NS3- 
like polypeptide, preferably other crystal forms of 

15 tNS3; NS3-like polypeptide/NS4A-like peptide, 

preferably tNS3/sNS4A; or complexes comprising any of 
the above* 

The structure coordinates are also 
particularly useful to solve the structure of crystals 

20 of NS3-like polypeptide/NS4A-like peptide complexes, 

particularly tNS3/sNS4A, co-complexed with a variety of 
chemical entities. This approach enables the 
determination of the optimal sites for interaction 
between chemical entities, including interaction of 

25 candidate NS3 inhibitors with NS3 or the NS3/NS4A 
complex. For example, high resolution X-ray 
diffraction data collected from crystals exposed to 
different types of solvent allows the determination of 
where each type of solvent molecul resides. Small 

30 molecules that bind tightly to those sites can then be 
designed and synthesized and tested for their NS3 
inhibition activity. 
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All of the complexes referred to above may be 
studied using well-known X-ray diffraction techniques 
and may be refined versus 1.5-3 A resolution X-ray data 
to an R value of about 0.20 or less using computer 
5 software, such as X-PLOR [Yale University, ©1992, 

distributed by Molecular Simulations, Inc.; see, e.g., 
Blundell & Johnson, supra ; Meth. Enzymol ., vol. 114 & 
115, H. w. Wyckoff et al., eds., Academic Press 
(1985)]. This information may thus be used to optimize 

10 known NS3 inhibitors, and more importantly, to design 
new NS3 inhibitors. 

In order that this invention be more fully 
understood, the following examples are set forth. 
These examples are for the illustrative purposes only 

15 and are not to be construed as limiting the scope of 
this invention in any way. 

EXAMPLE 1 

Expression and Purification of tNS3 
20 The truncated NS3 serine protease domain 

(tNS3) was cloned from a cDNA of the hepatitis C virus 
H. strain [Grakoui, A. et al., ^Expression and 
Identification of Hepatitis C Virus Polyprotein 
Cleavage Products", J. Virol. , 67, pp. 1385-1395 
25 (1993)]. The first 181 amino acids of NS3 (residues 

1027-1207 of the viral polyprotein) have been shown to 
contain the serine protease domain of NS3 that 
processes all four downstream sit s of the HCV 
polyprotein [Lin, C, et al., Hepatitis C Virus NS3 
30 Serine Proteinase: Trans-Cleavage Requirem nts and 
Processing Kinetics", J. Virol. 68, pp. 8147-8157 
(1994b)], so we expressed a (His) 6~fusion prot in bas d 
on this tNS3. The plasmid pET-BS(+) /HCV/T7-NS31 pi -His 
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was derived from pTM3/HCV/1027-1207 (NS3iei) (Id.), by 
using polymerase chain reaction to introduce epitope 
tags and new restriction sites • A T7-tag (ASMTGGQQMG) , 
from the N-terminus of the gene 10 protein of the T7 
5 bacteriophage 

[Tsai, D.E. et al., M Jn Vitro Selection of an RNA 
Epitope Immunologically Cross-Reactive With a Peptide"/ 
Proc. Natl. Acad. Sci. USA , 89, pp. 8864-8868 (1992)], 
was placed at the N-terminus of the tNS3 domain. Two 
10 linker residues (GS) were placed at the tNS3 C- 
terminus, followed by the (His)g-tag. E.coli 

JM109(DE3) cells, freshly transformed with the pET- 
BS(+)/HCV/T7-NS3i8l-His plasmid, were grown at 37 °C in 
complex media supplement with 100 pg/ml ampicillin, in 
15 a 10 L fermentor (Braun) . When the cell density 

reached an OD600 of 3-4 the temperature of the culture 

was rapidly reduced to 30 °C, and induction was 
immediately initiated by the addition of 1 mM IPTG. 
Cells were harvested at 2 h post-induction, and flash 

20 frozen at -70 °C prior to purification. 

The tNS3 was purified from the soluble 
fraction of the recombinant E.coli lysates as follows, 
with all procedures being performed at 4 °C unless 
stated otherwise. Cell paste (75-100g) was resuspended 

25 in 15 volumes of 50 mM HEPES, 0.3 M NaCl, 10% glycerol, 
0.1% B-octyl glucoside, 2 mM B-mercaptoethanol, pH 8.0. 
Cells were ruptur d using a microfluidi2er and th 
homogenate was clarified by centrifugation at 100,000 x 
g for 30 min. The supernatant was brought to 50 mM 

30 HEPES, 20 mM imidazole, 0.3 M NaCl, 27.5% glycerol, 
0.1% fl-octyl- glucoside, 2 mM fi-mercapto thanol, pH 
8.0, and appli d at 1.0 ml/min to a 7.0 ml Ni-Agarose 
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affinity column, equilibrated in the same buffer. 
After loading, the column was washed with 10-15 volumes 
of equilibration buffer and the bound proteins were 
eluted with equilibration buffer containing 0.35 M 
5 imidazole. The protein was then size-fractionated on 
two columns in series (each 2.6 cm x 90 cm) packed with 
Pharmacia high resolution SI 00 resin and equilibrated 
with 25 mM HEPES, 0.3M NaCl, 10% glycerol, 0.1% fl- 
octylglucoside, 2 mM B-mercaptoethanol, pH 8.0. The 

10 tNS3 fractions, identified by SDS-PAGE, were pooled and 
concentrated to 1 mg/ml using a Ami con Centriprep-10, 
and stored at -70°C. The tNS3 was thawed slowly on ice 
and the NS4A peptide (dissolved in the size-exclusion 
chromatography buffer) was added at a tNS3:NS4A-peptide 

15 molar ratio of 1:2. The sample was then diluted 2.5- 
fold with 15 mM MES, 0.5 M NaCl, 20 mM fi- 
mercaptoethanol, pH6.5, and concentrated to ~2 ml (~ 2 
mg/ml) by ultrafiltration. The sample was then dilut d 
2-fold with the pH 6.5 buffer and concentrated again to 

20 -2 ml. This dilution process was repeated until it 
gave a >40-fold dilution of the original buffer 
constituents. The protein sample was then concentrated 
to 13.0 mg/ml and centrifuged at ^300,000 x g for 20 

min at 4 °C. Concentrations of the pure tNS3 and 
25 tNS3/4A complex were determined by OV absorption 

spectroscopy, using a molar absorption coefficient 
(A 2 80> of 17,700 M-l-cm-1. 

EXAMPLE 2 

30 4A Peptide Synthesis and Purification 

Th HCV NS4A p ptide was synthesized to span 
residues Gly21 to Pro39 of the viral cofactor (residues 
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1678 to 1696 of the HCV polyprotein) , which 
incorporates the essential region reported to be 
essential for NS3 stimulation [Lin, C. et al. "A 
Central Region in the Hepatitis C Virus NS4A Protein 
5 Allows Formation of an Active NS3-NS4A Serine 

Proteinase Complex In Vivo and In Vitro ", J, Virol « 69 , 
pp. 4373-4380 (1995)]. Lysine residues were added to 
the termini to assist aqueous solubility, and a serin 
residue was substituted for Cys22 (residue 1679 of the 
10 polyprotein of the HCV H strain) . The peptide (H- 

KKGSWIVGRIVLSGKPAIIPKK-OH • TFA salt) was prepared by 
the solid-phase peptide synthesis (Applied Biosystems 
433A) beginning with N a -Fmoc, N e -Boc-Lys Wang resin. 

N a -ftaoc-protected amino acids were added sequentially 
15 using HBTU (2-(lH-benzotriazol-l-yl)l,l,3,3- 

tetramethyluronium hexafluorophosphate) with HOBt (1- 
hydroxybenzotriazole hydrate) as coupling agents in N- 
methylpyrrolidinone. Cleavage from the resin and 
global deprotection were accomplished with 95% 
20 trifluoroacetic acid and 5% water at room temperature 
for 1.5 hr (15 ml/ g resin). The peptide was purified 
by preparative HPLC on a Waters Delta Pak CI 8, 15 \m, 
30CA column (30 mm x 300 mm) eluting with a linear 
gradient of acetonitrile (15-40%) in 0,1% aqueous 
25 trifluoroacetic acid over 35 min (flow rate of 22 

ml/min) . Peptide purity was confirmed by analytical 
HPLC. The sequence was confirmed by direct N-terminal 
sequence analysis and matrix-assisted laser, de sorption 
mass spectrometry (Kratos MALDI I), which showed th 

30 corr ct (M + H) + and (M + Na) + molecular ions. 
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EXAMPLE 3 
Crystallization and Data Collection 
Crystals of the tNS3/NS4A complex were grown 
by hanging-drop vapor diffusion over a reservoir of 0.1 
5 M MES, 1«B M NaCl, 0.1 M sodium/potassium phosphate, 10 
mM B-mercaptoethanol, pH 6.5, The crystals grew over 
the course of 2-3 weeks, to final dimensions of about 
0.1 x 0.1 x 0.25 mm. The rhorabohedral crystals used in 
this study belonged to space group R32, with unit cell 

10 dimensions a=b=225.0A, and c«=75.5A, and contained two 
tNS3/NS4A complexes per asymmetric unit. 

Statistics for data collection, heavy atom 
refinement, and crystallographic refinement are given 
in Table 1. All heavy atom soaks were done in hanging- 

15 drops over the same reservoir as used for 

crystallization. Crystals were transferred to a 
stabilizing solution (50 mM MES, 2.0 M NaCl, 0.1 M 
sodium/potassium phosphate, 10 mM B-mercaptoethanol, 
and 20% glycerol, pH 6.2) and then frozen in a dry 

20 nitrogen gas stream at 100 K (Molecular Structure 
Corp., Houston, TX) for data collection. Data was 
acquired by oscillation photography on a Rigaku R-AXIS 
I1C phosphor imaging area detector mounted on a Rigaku 
RU200 rotating anode generator (MSG) , operating at 50kV 

25 and 100mA. Measured intensities were integrated, 

scaled, and merged using the HKL software package (Z. 
Otwinowski and W. Minor) . 

EXAMPLE 4 

30 Phasing, Model Building and Refinement 

Heavy atom positions were located by 
insp ction and confirmed with difference Fourier 
synth ses. H avy atom param t rs were refined and 
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phases computed to 3. lA using the program PHASES 
[Furey, W. and Swaminathan, S. "PHASES-95: a program 
package for the processing and analysis of diffraction 
data from macromolecules", Meth. En2ymol. , (1996) . MIR 
5 phases were improved and extended to 2.7A by cycles of 
solvent flattening [Wang, B.C., ^Resolution of Phase 
Ambiguity in Macromolecular Crystallography", Methods 
in Enzymol. 115, pp. 90-112 (1985)] combined with 
histogram matching (Zhang, K.Y.J, and Main, P., "The 

10 Use of Sayre's Equation With Solvent Flattening and 

Histogram Matching for Phase Extension and Refinement 
of Protein Structures", Acta Crystallogr. , A46, pp. 
377-381 (1990) ] using the CCP4 crystallographic package 
(Collaborative Computation Project, 1994). The 

15 resulting electron density map displayed nearly 

continuous density for the protein backbone as well as 
strong side chain density. Approximately 80% of the 
model could be unambiguously built into this map 
(QUANTA 4.1, Molecular Simulations), and a single round 

20 of simulated annealing refinement in X-PLOR [Brunger, 
A. T., *X-PLOR: A System for X-Ray Crystallography and 
NMR", New Kaven, Connecticut: Department of Molecular 
Biophysics and Biochemistry, Yale University (1993)] 
brought the R-factor to 29% and free R value to 33% 

25 [Brunger, A. T., "Free R Value: A Novel Statistical 
Quantity for Assessing the Accuracy of Crystal 
Structures'', Nature , 355, pp. 472-475 (1992)]. The 
remainder of the model was built and refined in several 
steps, by first extending the resolution to 2.5A and 

30 then adding well-ordered water mol cules. A final 

round of positional and individual temperature factor 
refinement brought the R-factor to 21.6% (free R value 
26.1%) for 26,652 refl ctions betw en 6.0 and 2.5A 
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(F>lsF) . The current model consisted of tNS3 residues 
1055-1206 and NS4A residues 1678-1693 in complex A, and 
tNS3 residues 1028-1206 and NS4A residues 1678-1696 for 
complex B (polyprotein numbering/ with 2 zinc atoms 
5 and 130 water molecules. A Ramachandran plot for the 
final model contained 91% of the residues in the most 
favored regions and 0% in disallowed or generously- 
allowed regions. The rms deviations from ideality were 
0.007A for bond lengths and 1.47° for bond angles. 

10 While we have described a number of 

embodiments of this invention, it is apparent that our 
basic examples may be altered to provide other 
embodiments which utilize the products and processes of 
this invention. Therefore, it will be appreciated that 

15 the scope of this invention is to be defined by the 

appended claims rather than by the specific embodiments 
which have been represented by way of example . 
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CLAIMS 

He claim: 

1. A composition comprising a HCV NS3-like 
5 polypeptide complexed with a NS4A-like peptide. 



2. The composition according to claim 1, 
wherein the HCV NS3-like polypeptide is a NS3 protease 
domain polypeptide or a NS3 protease domain-like 

10 polypeptide. 

3. The composition according to claim 1, 
wherein the HCV NS3-like polypeptide is tNS3. 

15 4. The composition according to any one of 

claims 1 to 3, wherein the NS4A-like peptide is H- 
KKGSWIVGRIVLSGKPAIIPKK-OH. 



5, A crystal comprising a HCV NS3-like 
20 polypeptide complexed with a NS4A-like peptide. 

6, The crystal according to claim 5, wher in 
the HCV NS3-like polypeptide is a NS3 protease domain 
polypeptide or a NS3 protease domain-like polypeptid . 

25 

7, The crystal according to claim 5, wher in 
the HCV NS3-like polypeptide is tNS3. 



30 



8. The crystal according to any one of 
claims 1 to 3, wherein the NS4A-like peptide is H- 
KKGSWIVGRIVLSGKPAIIPKK-OH . 
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9. The crystal according to claim 5, 
additionally comprising an inhibitor of HVC NS3. 

10. A machine* readable data storage medium, 

5 comprising a data storage material encoded with machine 
readable data, wherein the data is defined by the 
structure coordinates of a tNS3/sNS4A complex according 
to Figure 3, or a homologue of said complex, wherein 
said homologue comprises backbone atoms that have a 
10 root mean square deviation from the backbone atoms of 
the complex of not more than 1.5A 

11* The machine-readable data storage 
medium, according to claim 10, wherein said molecule or 

15 molecular complex is definfed by the set of structure 

coordinates for tNS3/sNS4A according to Figure 3, or a 
homologue of said molecule or molecular complex, said 
homologue having a root mean square deviation from th 
backbone atoms of said amino acids of not more than 1,5 

20 A. 

12. A machine- readable data storage medium 
comprising a data storage material encoded with a first 
set of machine readable data comprising a Fourier 

25 transform of at least a portion of the structural 
coordinates for tNS3/sNS4A according to Figure 3; 
which, when combined with a second set of machine 
readable data comprising an X-ray diffraction pattern 
of a molecul or molecular complex of unknown 

30 structure, using a machine programmed with instructions 
for using said first set of data and said second set of 
data, can det rmine at least a portion of the structure 
coordinates corresponding to the second set of machine 
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readable data, said first set of data and said second 
set of data* 

13. A method of obtaining structural 

5 information about a molecule or a molecular complex of 
unknown structure by using the structure coordinates 
set forth in Figure 3, comprising the steps of: 

a. generating X-ray diffraction data from 
said crystallized molecule or molecular complex; 
10 b. applying at least a portion of the 

structure coordinates set forth in Figure 3 to said 
X-ray diffraction pattern to generate a 
three-dimensional electron density map of at least a 
portion of the molecule or molecular complex. 

15 

14. The method according to claim 13, 
wherein the molecule or molecular complex of unknown 
structure comprises a polypeptide selected from a MS3- 
like polypeptide in complex with a NS4A-like peptide. 
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FIGURE 3 
tNS3 COORDINATES (Complex A) 
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WW 


GLU 

wLw 


1909 

Ifewfe 


79 090 


A7 *7A 
4/ .0/4 


A A ion 
04. IwO 


4 AA 

1.00 


A7 AA 

37.02 


ATOM 1086 

#i 1 w'lvl I www 


CD 

WW 


GLU 


1909 

Ifewfe 


71 147 


AP RA7 
40.04/ 


AO OAT 

Oo.fe4/ 


4 AA 

1.00 


J A A A 

40.33 


ATOM 1087 


OE1 

wb 1 


GLU 

vILU 


1909 

IfeWfe^ 


71 QOO 


AA QQA 
40.004 


ao oce 


4 AA 

1.00 


A A A A 

43.28 


ATOM 1088 


OE9 


GLU 

V3LU 


IfeUfe 


70 17* 


Afl QOA 
40.W&4 


AO A AO 

0O.46Z 


4 AA 
1.00 


>lifl CO 

44.53 


ATOM 1089 


c 

W 


GLU 

ULU 


1902 

I fewfe 


71 98*5 

/ l.feWw 1 


AA 1 99 
40. lfe^£ 


A7 71 0 
0/./ io 


4 AA 
1.00 


o-» A4 

37.01 


ATOM 1090 

#» 1 W/.VI 1 WWW 


o 


GLU 

WLW 


1202 

1 fewfe 


71 Q77 


AA A*Q 
40.4051 


AQ A70 
OO.O/O 


4 AA 
I.UU 


OO CO 
00.00 


ATOM 1091 


N 


THR 


1203 

1 feww 


70 043 


A* 879 


A7 QAC 
O/.OOO 


1 AA 
I.UU 


OA OO 


ATOM 1092 

#* ■ WiWl 1 WWfe 


CA 


THR 
1 1 lit 


1203 

1 feWW 




A* *Q9 


AO 17A 
0*7.1 /4 


1 AA 
I.UU 


oo oo 


ATOM 1093 

r» ■ V^l.l 1 WWW 


CB 


THR 


1203 

1 feww 


67 Q*8 

w/ »wWW 


A* 900 
Hj.lU«7 


AQ OOO 


1 AA 
I.UU 


OO A7 
OO.O/ 


ATOM 1094 

1 1 ■ X^ITI 1 WW f 


0G1 

V/w 1 


THR 


1203 

1 feWW 


67 898 

w/ *OfeO 


A A Q*1 


AA AOA 
00.404 


1 AA 
I.UU 


A A AA 

4^.U£ 


ATOM 1095 


CG2 


THR 


1203 

1 feww 


67 168 

w/ • 1 WW 


46 98A 


AP QAO 
OO.OOO 


1 AA 
I.UU 


OA Q7 
OO.O/ 


ATOM 1096 

#» ■ 1 « 1 1 WWW 


c 


THR 


1203 

1 fe WW 


70 156 

r w« IwW 


44 *48 


70 001 


1 AA 
I.UU 


OO CA 


ATOM 1097 

» » 1 %^IVI 1 WW f 


o 


THR 

• 1 If * 


1203 

1 feww 


70 425 


AA 7ftA 

*Hr./ w*r 


71 1PO 
/ 1 .lOO 


1 AA 
I.UU 


AA AC 
4U.OO 


ATOM 1098 

« • ■ %^IVI ■ WWW 


N 

■ i 


THR 

i i ii 1 


1204 

1 fe W^T 


70 479 

/ w»*t f w 


43 491 


AQ O70 


1 OA 
I.UU 


AA OO 


ATOM 1099 


CA 


THR 


1204 

• fe w^ 


71 180 

* 1 • 1 Ww 


42 328 

tfe.WfeW 


70 0A0 


1 OO 
I.UU 


AA *1 
4U.O! 


ATOM 1100 


CB 


THR 


1204 


71 409 

f 1 •"TWW 


41 146 

f 1 • 1 *tw 


AQ 07* 


1 OO 
I.UU 


OQ QA 
Oa.ww 


ATOM 1101 

» » ■ ^^»WI 1 1 1 


0Q1 


THR 

i • ii t 


1204 

• feW^ 


70 150 

i Wa 1 WW 


40 520 

"tW.wfeW 


88 7 A* 
wO./Ow 


1 OO 
I.UU 


07 70 


ATOM 1102 


CG2 


THR 


1204 


72 353 

# fe*WWW 


40 120 

~w. 1 feW 


69 677 


1 OO 
1 .UU 


A1 91 

4 1 .4& 1 


ATOM 1103 

* » ■ *^l*l ■ 1 W^W/ 


c 


THR 

i i ii ■ 


1204 

1 fe W^T 


72 492 


42 818 

"tfeaO 1 W 


70 AAA 


1 oo 

1 .UU 


A1 OO 
41 .OO 


ATOM 1104 


o 


THR 


1204 

• IiiW 1 


72 932 

* fe* WWfe 


42 399 

*tfe*Wfefe 


71 AAA 


1 oo 

1 .UU 


AO 18 
4L10 


ATOM 1105 


N 


MET 


1205 


73.101 

• w* • w ■ 


43 810 

iW»W 1 w 


70 012 

/ w.W 1 fe 


1 00 

1 tUU 


A9 Q9 

4&*96 


ATOM 1106 


CA 


MET 


1205 


74.338 


44.395 


70.510 


1.00 


44.88 


ATOM 1107 


CB 


MET 


1205 


75.064 


45.136 


69.388 


1.00 


43.43 


ATOM 1108 


CG 


MET 


1205 


75.778 


44.265 


68.388 


1.00 


41.30 


ATOM 1109 


SD 


MET 


1205 


76.649 


45.304 


67.195 


1.00 


38.98 


ATOM 1110 


CE 


MET 


1205 


75.690 


45.025 


65.732 


1.00 


39.73 


ATOM 1111 


C 


MET 


1205 


74.008 


45.401 


71.612 


1.00 


48.30 


ATOM 1112 


0 


MET 


1205 


74.680 


46.434 


71.732 


1.00 


50.45 


ATOM 1113 


N 


ALA 


1206 


72.985 


45.106 


72.415 


1.00 


51.29 
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O ALA 1206 70.842 44.487 74.147 1.00 52.28 



SNS4ACOORDINATES (Complex A) 



Atom 



Type 


Resid £ 


2S 


Y 


z 


OGp 






a. a a A A^t 

ATOM 2427 


N 


GLY 


A A"* A 

1678 


A A AAA. 

82.660 


A A Af A 

20.678 


AM A A 

69.142 


A MA 

1.00 


A"V A A 

47.82 


ATOM 2428 


CA 


GLY 


A A "FA 

1678 


A A AAP 

83.325 


A. ^ A A f 

21.195 


A** A A ^ 

67.901 


1.00 


47.45 


ATOM 2429 


C 


GLY 


A A"WA. 

1678 


A A AAA 

83.444 


A A ••A A 

22.729 


67.863 


1.00 


A A ^ A 

42.58 


ATOM 2430 


O 


GLY 


a a»fa 

1678 


A A AAA 

83.086 


AM A 

23.435 


68.815 


1.00 


46.32 


ATOM 2431 


N 


SER 


1679 


83.951 


23.237 


66.745 


1.00 


39.26 


ATOM 2432 


CA 


SER 


a a— a, 

1679 


A A A A A 

84.188 


24.669 


66.594 


1.00 


35.38 


ATOM 2433 


CB 


SER 


1679 


85.628 


24.981 


67.011 


1.00 


34.24 


ATOM 2434 


OG 


SER 


1679 


85.746 


26.265 


67.587 


1.00 


33.09 


ATOM 2435 


C 


SER 


1679 


A A A A A 

83.965 


25.057 


65.134 


1.00 


32.76 


ATOM 2436 


O 


SER 


1679 


84.084 


24.218 


64.240 


1.00 


34.29 


ATOM 2437 


N 


VAL 


1680 


AA P*A A 

83*594 


26.307 


64.894 


1.00 


29.61 


ATOM 2438 


CA 


VAL 


a a a a 

1680 


A A A A 

83.365 


A A *V A A 

26.784 


63.533 


1.00 


A4 AA 

27.92 


ATOM 2439 


CB 


VAL 


A AMA 

1680 


A A Ai J 

82.341 


MM AAA 

27.936 


A A A A ^ 

63.495 


A A A 

1.00 


A\mm m\a% 

27.03 


* *V^S. A A A\ A A0\ 

ATOM 2440 


CG1 


VAL 


1680 


A 4 A A^ 

81.995 


A A Aj«=7 c 7 

28.277 


ma AAA 

62.069 


A A A 

1.00 


A A A A 

26.39 


A M A A A 

ATOM 2441 


CG2 


VAL 


A AAA 

1680 


A A AAA 

81.099 


p a a 

27.569 


64.277 


A A A 

1.00 


A*9 AA 

27.69 


A A A A A A 

ATOM 2442 


C 


VAL 


A AAA 

1680 


A A AAA 

84.692 


A«V A 4 ^ 

27.31 1 


A A AA^ 

63.007 


A A A 

1.00 


A A A|> 

26.65 


ATOM 2443 


O 


VAL 


1680 


85.349 


28.115 


A A AAA 

63.666 


A AA 

1.00 


A A AA 

28.02 


ATOM 2444 


N 


t * a i 

VAL 


4 A 0 4 

1681 


OF 4 

85.108 


26.844 


61 .842 


4 AA 

1.00 


A 4 OA 

24.80 


ATOM 2445 


A^ A 

CA 


nil 

VAL 


1681 


86.364 


A— » AAA 

27.300 


61278 


4 AA 

1.00 


AO A A 

23.49 


ATOM 2446 


CB 


VAL 


4 CQ4 

1681 


Q*7 004 

87.331 


26.117 


61 .027 


4 nn 
1.00 


00 nc 
23.05 


ATOM 2447 


CG1 


\ 0 At 

VAL 


4 CQ4 


o/.o» 1 


oe qojI 
25.384 


eo 04 >t 
62.314 


4 nn 
1.00 


04 07 
21.97 


ATOM 2448 


CG2 


VAL 


lool 


oo./ol 


oe 4 co 
25.153 


en A4 0 
60.01 0 


4 nn 
1.00 


Oil 04 

24.91 


ATAU Oil Af\ 

ATOM 2449 


c 


If At 

VAL 


lool 


oo.lOo 


OO AC4 

28.061 


en noe 
59 .985 


4 nn 
1.00 


OO 74 

22.71 


ATOM 2450 


A^ 

o 


VAL 


lool 


ob* loo 


0"7 7CO 

27.762 


en OC4 


4 nn 
1.00 


OO OO 

22.32 


ATOM 2451 


A. ■ 

N 


ILE 


4 coo 
loo2 


80.923 


on /xt? 
28.077 


en *7on 
59.729 


4 nn 
1.00 


OO 7fl 

22.79 


ATAII Oil CO 

ATOM 2452 


CA 


ILE 


4 COO 

1682 


oe QAQ 


on ofifi 
29.899 


eo eoo 
58.528 


4 nn 
1.00 


4 a oe 
19.66 


ATOM 2453 


CB 


ILE 


4 COO 

1682 


07 onn 
87.309 


31.332 


eo 04 4 

58.81 1 


4 AA 

1.00 


4 e on 
16.80 


ATOM 2454 


CG2 


ILE 


1682 


87.324 


32.164 


57.540 


1.00 


16.76 


ATOM 2455 


CG1 


ILE 


1682 


86.417 


31.981 


59.875 


1.00 


13.77 


ATOM 2456 


CD1 


ILE 


1682 


86.965 


33.244 


60.432 


1.00 


9.56 


ATOM 2457 


C 


ILE 


1682 


87.627 


29.247 


57.418 


1.00 


19.97 


ATOM 2458 


O 


ILE 


1682 


88.844 


29.115 


57.522 


1.00 


20.81 


ATOM 2459 


N 


VAL 


1683 


86.946 


28.826 


56.361 


1.00 


20.17 


ATOM 2460 


CA 


VAL 


1683 


87.599 


28.157 


55.253 


1.00 


18.67 


ATOM 2461 


CB 


VAL 


1683 


86.827 


26.867 


54.857 


1.00 


20.13 


ATOM 2462 


CG1 


VAL 


1683 


86.595 


25.992 


56.091 


1.00 


19.43 


ATOM 2463 


CG2 


VAL 


1683 


85.511 


27.205 


54.188 


1.00 


21.44 


ATOM 2464 


C 


VAL 


1683 


87.785 


29.039 


54.027 


1.00 


17.87 


ATOM 2465 


O 


VAL 


1683 


88.279 


28.592 


52.993 


1.00 


19.88 


ATOM 2466 


N 


GLY 


1684 


87.400 


30,297 


54.135 


1.00 


16.04 


ATOM 2467 


CA 


GLY 


1684 


87.547 


31.186 


53.005 


1.00 


13.85 
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ATOM 2471 


CA 


ARG 


1685 


86.660 

ww* www 


34 861 

w^r* ww i 


52 660 


1 00 


18 38 


ATOM 2472 


CB 


ARG 


1685 


87.800 

W • *S#WW 


35 715 

WW*) 9 • W 


53^19 

WW*)**b 1 w 


1 00 

1 .WW 


17 53 

1 / *WW 


ATOM 2473 


CG 


ARG 


1685 


88.339 

WwtWWw 


35 249 


54 559 


1 00 

1 .WW 


18 on 


ATOM 2474 


CD 


ARG 


1685 


89 566 

ww* Www 


36 039 

Ww* www 


54 945 


1 00 

I «ww 


9n 70 


ATOM 2475 


NE 


ARG 


1685 


90 609 

WW* www 


35 890 

ww.Www 


53 940 


1 on 

1 .WW 


91 no 


ATOM 2476 


CZ 


ARG 


1685 


91 532 

w • >WW*& 


36 808 

w w.wWw 


53 676 


1 00 

1 ivV 


99 84 


ATOM 2477 


NH1 


ARG 


1685 


91 552 

w 1 • Ww*w 


37 957 

w # .ww r 


54 348 


1 00 

1 .WW 


91 88 


ATOM 2478 


NH2 


ARG 


1685 


92 426 


36 581 

WW. WW 1 


59 799 


1 00 


91 n*? 


ATOM 2479 


c 


ARG 


1685 

• www^ 


86 180 

ww* 1 Uw 


35 460 


ci q^q 

W 1 .W*TW 


1 on 


m 81 


ATOM 2480 


0 


ARG 


1685 

9 WW 


86 606 

WW.WWW 


35 034 


CA070 

wU.*C/w 


1 on 


17 


ATOM 2481 


N 


ILE 


1686 


85 313 

WW* W 1 W 


36 484 


9 1 >4g4 


1 on 


1A fi9 


ATOM 2482 


CA 


ILE 


1686 

• Www 


84 761 

W*F. f w 1 


37 149 


CA OR7 


1 nn 


iq ia 

10.14 


ATOM 2483 

• l ■ >*r 1 Ww ¥ WW ; 


CB 


ILE 


1686 

■ www 


83 249 


38 7QQ 

WO* / 99 


CA 1AA 
OU. 1 Uw 




1A 01 


ATOM 2484 


CG2 


ILE 


1686 

1 www 


82 530 


O/ tOlU 


4Q 91 A 


1 nn 

l.UU 


q cn 

9.0U 


ATOM 2485 

#¥ • VfWl *W~WW 


CG1 


ILE 


1686 

• WWW 


S3 0Q8 

U0«U90 


Ow.OOU 




1 nn 

l.UU 


ie 

10.00 


ATOM 248B 

#» i w^ivi *w ¥ w w 


CD1 


ILE 


1686 

1 www 


O 1 *wOU 


34 05A 
04.900 


AQ oqo 


1 nn 

l.UU 


lo.lo 


ATOM 2487 

#» i v*rivi *w~w » 


c 

w 


ILE 


1686 

1 www 


84 QOQ 


op 

OO.Ow/ 


A79 


1 nn 
l.UU 


10 CO 


ATOM 2488 

#» 1 V./IVI *W^WW 


o 


ILE 

II nln 


1686 

I www 


Ad 814 


qO 1A9 
Oil. l^f*C 


Ci CAQ 


1 nn 
l.UU 


iii ne 
14.U5 


ATOM 2489 


N 


VAL 


16B7 




OQ OQO 


AO ACA 


1 nn 
l.UU 


4 c n* 
lb.Ul 


ATOM 2490 

*% | W/IVI ¥•» ¥ W W 


CA 

. w#» 


VAL 


1687 

1 VUr 


oc Me 


40 7A8 


ao cnft 


1 nn 
l.UU 


ic ie 
lo.lo 


ATOM 2491 


CB 

%**W 


VAL 


1687 

1 WW 0 


PR QQ7 


41 


AO r\Ko 


1 nn 
uuu 


lO.OD 


ATOM 2492 


CG1 

WW 1 


VAL 


1687 


87 9Q0 

Or 


40 707 


A7 A11 


1 nn 

1 «UU 


19 TO 


ATOM 2493 


CG2 


VAL 

W ¥*W 


1687 

1 WW § 


87 

Offal WW 




AO 91 A 


1 nn 
1 .UU 


ic n7 
10.U/ 


ATOM 2494 


c 

^¥* 


VAL 


1687 

1 VUI 




41 W 


Aft cao 


1 nn 

l.UU 


1 A 77 
lO./ / 


ATOM 2495 


o 

*¥* 


VAL 

¥/ ¥Mb 


1687 

1 WW 9 


84 558 


41 919 


A7 ^79 


1 nn 

l.UU 


1 A 7A 


ATOM 2496 


N 


LEU 


1688 

I WWW 


83 500 

WW* WWW 


49 017 


40 174 


1 nn 
1 *uu 


1 A *?7 
lO.O/ 


ATOM 2497 

m » • ^*¥IV. %» ¥ W * 


CA 


LEU 


1688 

• WWW 


82 394 

w*w*ww 1 1 


49 576 


4fl 411 


1 nn 
1 .uu 


1A 1A 

I O.IO 


ATOM 2498 

• » ¥ t ¥ ¥ ¥» ¥ W \m 


CB 


LEU 


1688 

1 WWW 


81 400 

W 1 •"Tww 


43 954 


AO q44 

H9,J*tH 


1 no 

1 .WW 


15 qn 

IO.OU 


ATOM 2499 


CG 

%*r%¥* 


LEU 


1688 

• www 


80 742 

ww* / *r*& 


49 398 


jU*009 


1 nn 


14 41 


ATOM 2500 


CD1 


LEU 


1688 

• WWW 


79 737 

r w* / W / 


43 199 

*tw« 1 *C*W 


51 17^ 


1 nn 

1 .WW 


19 7n 


ATOM 2501 


CD2 


LEU 


1688 


80 083 

WW* WWW 


41 142 


4Q 879 


1 no 


m no 


ATOM 2502 


c 


LEU 


1688 

■ WWW 


82.737 

wfct 9 W f 


43 502 


47 260 


1 00 




ATOM 2503 


0 


LEU 


1688 


82.047 


43 504 

~w* w w^ 


46 243 


1 00 

1 .WW 


23 25 

*W«Cw 


ATOM 2504 


N 


SER 


1689 


83.803 

WW 7 * WW\*¥ 


44 277 

• ^ **W ¥ # 


47 401 


1 00 


23 03 

•WW. WW 


ATOM 2505 


CA 


SER 


1689 


84.209 


45 213 

~w**» 1 w 


46 353 


1 00 

1 .WW 


21 96 

tm 1 .WW 


ATOM 2506 


CB 


SER 


1689 


84.999 


46 366 

~w.www 


46 968 


1 00 


21 45 


ATOM 2507 


OG 


SER 


1689 


86.151 


45 888 

^ W.WWW 


47 643 


1 00 

1 .WW 


24 53 


ATOM 2508 


c 


SER 


1689 


85.031 

. WW.WW 1 


44 593 

~ T»WWW 


45 938 


1 00 

1 .WW 


29 42 


ATOM 2509 


0 


SER 


1689 


85 306 

WW. Www 


45 259 

*tw.*WW*C 


44 9^Q 


1 on 

1 .uu 


95 59 


ATOM 2510 


N 


GLY 


1690 


85.430 


43 337 

ww. WW f 


45 409 


1 00 


21 46 

*w 1 i*ru 


ATOM 2511 


CA 


GLY 


1690 


86.242 


42.676 


44.406 


1.00 


2123 


ATOM 2512 


C 


GLY 


1690 


87.690 


43.130 


44.463 


1.00 


23.65 


ATOM 2513 


0 


GLY 


1690 


88.451 


42.922 


43.521 


1.00 


25.98 


ATOM 2514 


N 


LYS 


1691 


88.082 


43.738 


45.577 


1.00 


23.43 


ATOM 2515 


CA 


LYS 


1691 


89.445 


44.224 


45.746 


1.00 


23.50 


ATOM 2516 


CB 


LYS 


1691 


89.561 


44.980 


47.067 


1.00 


21.39 


ATOM 2517 


CG 


LYS 


1691 


90.907 


45.621 


47.286 


1.00 


21.65 


ATOM 2518 


CD 


LYS 


1691 


90.888 


46.513 


48.507 


1.00 


22.73 
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ATOM 2522 




LYS 


1691 

■ WW • 


90.440 

w w* ■ ■ \j 


42 133 


46424 


1 00 

• *ww 


26 46 

C*W*~W 


ATOM 9529 


N 


PRO 

i ny 


1692 


91 475 


43 249 


44 765 


1 00 


25 07 


ATOM 2S24 


CD 


PRO 


1692 


91 517 

v 1 #w 1 « 


44 246 


43 682 


1 00 


24 13 


ATOM 


OA 


PRO 


1692 


92 549 


42 270 


44 598 


1 00 


9& 54 


ATOM 


OB 


PRO 


1692 


92 933 


42 456 


43 143 


1 00 

1 •WW 


22 92 


ATOM 2527 


CG 


PRO 


1692 


92.815 


43.920 


42.986 


1.00 


23.88 


ATOM 2528 


C 


PRO 


1692 


93.716 


42.601 


45.516 


1.00 


27.60 


ATOM 2529 


0 


PRO 


1692 


93.810 


43.716 


46.029 


1.00 


29.48 


ATOM 2530 


N 


ALA 


1693 


94.570 


41.620 


45.770 


1.00 


31.04 


ATOM 2531 


CA 


ALA 


1693 


95.736 


41.833 


46.621 


1.00 


34.86 


ATOM 2532 


CB 


ALA 


1693 


96.112 


40.543 


47.326 


1.00 


3521 


ATOM 2533 


C 


ALA 


1693 


96.915 


42.354 


45.798 


1.00 


35.37 


ATOM 2534 


0 


ALA 


1693 


96.891 


42.145 


44.559 


1.00 


35.58 



tNS3COORD1NATES (ComplexB) 



Atom 



1 VPS 




£ 


v 

A 


v 


it, 


\J\JXJ 


D 

B 




ATOM 1 1 i A 




ddo 


1H9A 


7A ft9n 


AK eci 


40.0/0 


1 nn 


Oft £Q 


ATOM mo 

aiuivi my 


on 


rr\\J 


lU&O 


I O.U/O 




aa ecn 


4 nn 


Oft 


ATOM 1 1 on 
A I Um 1 1 £\J 


oa 


DDO 


IVJ£0 


7A flftQ 
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1142 


73.080 


61.695 


48.640 


1.00 


26.82 


ATOM 1967 


CG1 


VAL 


1142 


72.710 


63.160 


48.504 


1.00 


25.54 


ATOM 1968 


CG2 


VAL 


1142 


74.398 


61.554 


49.368 


1.00 


2422 


ATOM 1969 


C 


VAL 


1142 


70.650 


61.047 


48.640 


1.00 


30.42 


ATOM 1970 


O 


VAL 


1142 


70.465 


60.445 


47.575 


1.00 


31.72 


ATOM 1971 


N 


ARG 


1143 


69.708 


61.788 


49220 


1.00 


32.81 


ATOM 1972 


CA 


ARG 


1143 


68.411 


62.006 


48.580 


1.00 


33.55 


ATOM 1973 


CB 


ARG 


1143 


67.282 


62.037 


49.613 


1.00 


35.71 


ATOM 1974 


CG 


ARG 


1143 


65.903 


62.331 


49.017 


1.00 


37.69 


ATOM 1975 


CD 


ARG 


1143 


65.465 


61.288 


47.991 


1.00 


39.30 


ATOM 1976 


NE 


ARG 


1143 


64.225 


61.685 


47.311 


1.00 


4320 


ATOM 1977 


CZ 


ARG 


1143 


63.383 


60.843 


46.711 


1.00 


43.40 


ATOM 1978 


NH1 


ARG 


1143 


63.628 


59.539 


46.696 


1.00 


43.88 


ATOM 1979 


NH2 


ARG 


1143 


62.297 


61.307 


46.108 


1.00 


43.09 


ATOM 1980 


C 


ARG 


1143 


68.439 


63.320 


47.818 


1.00 


3327 


ATOM 1981 


O 


ARG 


1143 


68.794 


64.353 


48.382 


1.00 


33.50 


ATOM 1982 


N 


ARG 


1144 


68.044 


63280 


46.549 


1.00 


33.18 


ATOM 1983 


CA 


ARG 


1144 


68.038 


64.471 


45.709 


1.00 


3420 


ATOM 1984 


CB 


ARG 


1144 


67.725 


64.089 


44260 


1.00 


32.94 


ATOM 1985 


CG 


ARG 


1144 


68.475 


64.900 


43203 


1.00 


32.47 


ATOM 1986 


CD 


ARG 


1144 


67.952 


64.546 


41.815 


1.00 


32.92 


ATOM 1987 


NE 


ARG 


1144 


68.924 


64.732 


40.738 


1.00 


33.75 


ATOM 1988 


CZ 


ARG 


1144 


69.459 


65.898 


40.387 


1.00 


35.07 


ATOM 1989 


NH1 


ARG 


1144 


69.125 


67.007 


41.029 


1.00 


36.60 


ATOM 1990 


NH2 


ARG 


1144 


70.321 


65.959 


39.378 


1.00 


35.64 


ATOM 1991 


C 


ARG 


1144 


67.001 


65.467 


46227 


1.00 


35.48 


ATOM 1992 


O 


ARG 


1144 


65.825 


65.139 


46.374 


1.00 


36.95 


ATOM 1993 


N 


ARG 


1145 


67.449 


66.670 


46.546 


1.00 


36.40 


ATOM 1994 


CA 


ARG 


1145 


66.552 


67.700 


47.048 


1.00 


37.95 


ATOM 1995 


CB 


ARG 


1145 


67.025 


68213 


48.414 


1.00 


37.18 


ATOM 1996 


CG 


ARG 


1145 


66.975 


67.184 


49.514 


1.00 


36.94 


ATOM 1997 


CD 


ARG 


1145 


65.570 


66.667 


49.681 


1.00 


38.43 


ATOM 1998 


NE 


ARG 


1145 


65.447 


65.729 


50.791 


1.00 


41.86 


ATOM 1999 


CZ 


ARG 


1145 


64.341 


65.043 


51.062 


1.00 


43.83 


ATOM 2000 


NH1 


ARG 


1145 


63.263 


65.187 


50299 


1.00 


46.04 


ATOM 2001 


NH2 


ARG 


1145 


64.306 


64223 


52.102 


1.00 


45.68 


ATOM 2002 


C 


ARG 


1145 


66.507 


68.856 


46.063 


1.00 


39.67 


ATOM 2003 


O 


ARG 


1145 


66.247 


70.000 


46.449 


1.00 


4226 


ATOM 2004 


N 


GLY 


1146 


66.789 


68.565 


44.797 


1.00 


39.96 


ATOM 2005 


CA 


GLY 


1146 


66.779 


69.605 


43.788 


1.00 


40.13 


ATOM 2006 


C 


GLY 


1146 


67.932 


69.428 


42.823 


1.00 


40.68 


ATOM 2007 


O 


GLY 


1146 


68.829 


68.621 


43.061 


1.00 


41.79 


ATOM 2008 


N 


ALA 


1147 


67.937 


70.228 


41.765 


1.00 


40.55 


ATOM 2009 


CA 


ALA 


1147 


68.961 


70.157 


40.733 


1.00 


39.49 


ATOM 2010 


CB 


ALA 


1147 


68.890 


71.392 


39.834 


1.00 


40.92 


ATOM 2011 


C 


ALA 


1147 


70.392 


69.927 


41214 


1.00 


38.50 


ATOM 2012 


O 


ALA 


1147 


71.068 


69.024 


40.717 


1.00 


38.36 


ATOM 2013 


N 


SER 


1148 


70.854 


70.721 


42.176 


1.00 


37.84 
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ATOM 2017 


C 


SER 


1148 


72.391 


70.363 


44.156 


1.00 


36.04 


ATOM 2018 


O 


SER 


1148 


73.382 


70.812 


44.734 


1.00 


36.47 


ATOM 2019 


N 


ARG 


1149 


71.452 


69.672 


44.788 


1.00 


34.56 


ATOM 2020 


CA 


ARG 


1149 


71.575 


69.416 


46.214 


1.00 


33.98 


ATOM 2021 


CB 


ARG 


1149 


70.923 


70.518 


47.050 


1.00 


35.91 


ATOM 2022 


CG 


ARG 


1149 


71.621 


70.740 


48.397 


1.00 


40.83 


ATOM 2023 


CD 


ARG 


1149 


70.760 


71.514 


49.389 


1.00 


44.70 


ATOM 2024 


NE 


ARG 


1149 


71.544 


72.142 


50.456 


1.00 


48.33 


ATOM 2025 


cz 


ARG 


1149 


71.832 


73.444 


50.506 


1.00 


50.37 


ATOM 2026 


NH1 


ARG 


1149 


71.416 


74.266 


49.552 


1.00 


52.98 


ATOM 2027 


NH2 


ARG 


1149 


72.508 


73.941 


51.531 


1.00 


51 .44 


ATOM 2028 


C 


ARG 


1149 


70.981 


68.070 


46.573 


1.00 


32.53 


ATOM 2029 


O 


ARG 


1149 


70.168 


67.513 


45.833 


1.00 


32.88 


ATOM 2030 


N 


GLY 


1150 


71.418 


67.541 


47.705 


1.00 


31.86 


ATOM 2031 


CA 


GLY 


1150 


70.934 


66.260 


48.171 


1.00 


29.91 


ATOM 2032 


C 


GLY 


1150 


71.207 


66.138 


49.653 


1.00 


28.10 


ATOM 2033 


O 


GLY 


1150 


72.237 


66.603 


50.137 


1.00 


29.55 


ATOM 2034 


N 


SER 


1151 


70.289 


65.516 


50.376 


1.00 


27.99 


ATOM 2035 


CA 


SER 


1151 


70.439 


65.348 


51.811 


1.00 


28.25 


ATOM 2036 


CB 


SER 


1151 


69.072 


65.434 


52.487 


1.00 


31.79 


ATOM 2037 


OG 


SER 


1151 


68.433 


66.664 


52.201 


1.00 


39.70 


ATOM 2038 


C 


SER 


1151 


71.071 


64.020 


52.164 


1.00 


26.53 


ATOM 2039 


O 


SER 


1151 


70.653 


62.974 


51.662 


1.00 


25.17 


ATOM 2040 


N 


LEU 


1152 


72.089 


64.055 


53.013 


1.00 


25.87 


ATOM 2041 


CA 


LEU 


1152 


72.706 


62.814 


53.443 


1.00 


27.78 


ATOM 2042 


CB 


LEU 


1152 


73.948 


63.075 


54.296 


1.00 


26.21 


ATOM 2043 


CG 


LEU 


1152 


75.157 


63.729 


53.627 


1.00 


24.51 


ATOM 2044 


CD1 


LEU 


1152 


76.335 


63.659 


54.564 


1.00 


23.77 


ATOM 2045 


C02 


LEU 


1152 


75.496 


63.008 


52.343 


1.00 


25.98 


ATOM 2046 


C 


LEU 


1152 


71.629 


62.147 


54.287 


1.00 


29.68 


ATOM 2047 


O 


LEU 


1152 


70.965 


62.819 


55.075 


1.00 


31.97 


ATOM 2048 


N 


LEU 


1153 


71.385 


60.860 


54.065 


1.00 


31.23 


ATOM 2049 


CA 


LEU 


1153 


70.372 


60.159 


54.845 


1.00 


31.26 


ATOM 2050 


CB 


LEU 


1153 


69.991 


58.836 


54.181 


1.00 


29.73 


ATOM 2051 


CG 


LEU 


1153 


68.811 


58.932 


53.215 


1.00 


27.81 


ATOM 2052 


CD1 


LEU 


1153 


69.023 


60.036 


52.193 


1.00 


29.89 


ATOM 2053 


CD2 


LEU 


1153 


68.619 


57.614 


52.528 


1.00 


28.98 


ATOM 2054 


C 


LEU 


1153 


70.862 


59.933 


56.269 


1.00 


33.19 


ATOM 2055 


O 


LEU 


1153 


70.075 


59.670 


57.182 


1.00. 


34.12 


ATOM 2056 


N 


SER 


1154 


72.166 


60.072 


56.459 


1.00 


34.74 


ATOM 2057 


CA 


SER 


1154 


72.775 


59.899 


57.762 


1.00 


37.99 


ATOM 2058 


CB 


SER 


1154 


73.331 


58.483 


57.903 


1.00 


40.16 


ATOM 2059 


OG 


SER 


1154 


72.446 


57.531 


57.326 


1.00 


44.40 


ATOM 2060 


C 


SER 


1154 


73.910 


60.907 


57.871 


1.00 


38.37 


ATOM 2061 


O 


SER 


1154 


74.946 


60.743 


57.236 


1.00 


38.18 


ATOM 2062 


N 


PRO 


1155 


73.694 


62.005 


58.622 


1.00 


38.99 


ATOM 2063 


CD 


PRO 


1155 


72.505 


62.296 


59.437 


1.00 


39.19 


ATOM 2064 


CA 


PRO 


1155 


74.708 


63.045 


58.805 


1.00 


38.38 
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ATOM 2068 


O 


PRO 


1155 


76.121 


61.531 


60.044 


1.00 


36.59 


ATOM 2069 


N 


ARG 


1156 


77.104 


63.088 


58.732 


1.00 


36.29 


ATOM 2070 


CA 


ARG 


1156 


78.456 


62.654 


59.023 


1.00 


3523 


ATOM 2071 


CB 


ARG 


1156 


79.031 


61.956 


57.803 


1.00 


38.13 


ATOM 2072 


CG 


ARG 


1156 


78.185 


60.870 


57.211 


1.00 


41.38 


ATOM 2073 


CD 


ARG 


1156 


78.700 


59.509 


57.644 


1.00 


44.77 


ATOM 2074 


NE 


ARG 


1156 


78.123 


58.412 


56.868 


1.00 

■ *ww 


49.49 

~W."TW 


ATOM 2075 


CZ 


ARG 


1156 


78.459 


57.129 


57.005 


1.00 


5379 


ATOM 2076 


NH1 


ARG 


1156 


77.873 


56.205 


56.252 


1.00 

• .ww 


54.43 


ATOM 2077 


NH2 


ARG 


1156 


79.403 


56.763 


57.863 

%0 * • ww%^ 


1.00 

1 .ww 


52 82 


ATOM 2078 


C 


ARG 


1156 


79.340 


63.849 


59255 

ww.fcww 


1.00 

1 • w w 


34 55 

w~.w w 


ATOM 2079 


O 


ARG 


1156 


79.066 


64.939 


58.756 


1 00 

• • WW 


35 72 

WW*# fa 


ATOM 2080 


N 


PRO 


1157 


80.443 


63.673 


59.989 

ww.www 


1.00 

■ .WW 


34 18 

W~. 1 w 


ATOM 2081 


CD 


PRO 


1157 


80.899 


62.473 


60.707 


1 00 

1 iww 


33.27 


ATOM 2082 


CA 


PRO 


1157 


81.344 


64.798 


60233 


1.00 

1 .WW 


33 99 

WW.W W 


ATOM 2083 


CB 


PRO 


1157 


82.419 


64.167 


61.111 


1.00 

I .WW 


33.27 

WW .fa 9 


ATOM 2084 


CG 


PRO 


1157 


81.690 


63.074 


61 .803 

w • .www 


1.00 

■ .WW 


3225 

Wfa.faw 


ATOM 2085 


C 


PRO 


1157 


81.950 


65.234 


58.898 


1 00 

1 .WW 


35 59 

WW .WW 


ATOM 2086 


0 


PRO 


1157 


82.404 


64.393 


58.130 


1.00 

■ .WW 


3792 

wf »W fa 


ATOM 2087 


N 


ILE 


1158 


81.999 


66.527 


58.631 


1 00 


36 57 

ww»w • 


ATOM 2088 


CA 


ILE 


1158 


82.542 


67.000 


57.362 


1 00 

■ .WW 


38 10 

WW* 1 w 


ATOM 2089 


CB 


ILE 


1158 


82.647 


68.533 


57.315 


1 00 

1 .WW 


38 12 

WW. 1 fa 


ATOM 2090 


CG2 


ILE 


1158 


81.272 


69.140 


57 171 

W 9 t 1 1 I 


1 00 

1 .WW 


3734 

W# »W"T 


ATOM 2091 


CG1 


ILE 


1158 


83.342 


69.046 


58 573 


1 00 

1 .WW 


41 61 

"T 1 »W 1 


ATOM 2092 


CD1 


ILE 


1158 


83.623 


70.540 


58 559 

WW* WWW 


1 00 

1 • WW 


45 60 

*TW.WW 


ATOM 2093 


C 


ILE 


1158 


83.911 


66.408 


57.004 


1.00 


39.96 

WW* WW 


ATOM 2094 


O 


ILE 


1158 


84.280 


66.367 


55.828 


1 00 


41.17 

» 1 • 1 9 


ATOM 2095 


N 


SER 


1159 


84.642 


65.941 


58.014 


1.00 


40.39 

^w*ww 


ATOM 2096 


CA 


SER 


1159 


85.959 


65.341 


57.813 


1.00 


42.66 


ATOM 2097 


CB 


SER 


1159 


86.575 


64.975 


59.161 


1.00 


45.98 


ATOM 2098 


OG 


SER 


1159 


85.755 


64.035 


59.842 


1.00 


49.77 


ATOM 2099 


C 


SER 


1159 


85.870 


64.078 


56.955 


1.00 


42.30 


ATOM 2100 


O 


SER 


1159 


86.779 


63.761 


56.183 


1.00 


43.56 


ATOM 2101 


N 


TYR 


1160 


84.779 


63.346 


57.130 


1.00 


40.66 


ATOM 2102 


CA 


TYR 


1160 


84.531 


62.121 


56.385 


1.00 


40.36 


ATOM 2103 


CB 


TYR 


1160 


83.137 


61.598 


56.756 


1.00 


43.66 


ATOM 2104 


CG 


TYR 


1160 


82.848 


60.149 


56.423 


1.00 


47.75 


ATOM 2105 


CD1 


TYR 


1160 


82.086 


59.377 


57290 


1.00 


50.46 


ATOM 2106 


CE1 


TYR 


1160 


81.763 


58.059 


56.991 


1.00 


53.68 


ATOM 2107 


CD2 


TYR 


1160 


83293 


59.561 


55233 


1.00 


49.34 


ATOM 2108 


CE2 


TYR 


1160 


82.975 


58237 


54.922 


1.00 


51.68 


ATOM 2109 


CZ 


TYR 


1160 


82.205 


57.493 


55.809 


1.00 


53.15 


ATOM 2110 


OH 


TYR 


1160 


81.858 


56.185 


55.528 


1.00 


5420 


ATOM 2111 


C 


TYR 


1160 


84.563 


62.464 


54.901 


1.00 


37.91 


ATOM 2112 


O 


TYR 


1160 


85.111 


61.715 


54.080 


1.00 


38.62 


ATOM 2113 


N 


LEU 


1161 


84.020 


63.632 


54.578 


1.00 


3426 


ATOM 2114 


CA 


LEU 


1161 


83.932 


64.086 


53.203 


1.00 


30.68 


ATOM 2115 


CB 


LEU 


1161 


82.639 


64.889 


53.018 


1.00 


27.49 
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ATOM 2119 


C 


LEU 


1161 


85.148 


64.852 


52.675 


1.00 


30.40 


ATOM 2120 


O 


LEU 


1161 


85.242 


65.106 


51.473 


1.00 


31.18 


ATOM 2121 


N 


ALA 


1162 


86.094 


65.189 


53.545 


1.00 


29.45 


ATOM 2122 


CA 


ALA 


1162 


87.283 


65.919 


53.101 


1.00 


29.15 


ATOM 2123 


CB 


ALA 


1162 


88.222 


66.214 


54.281 


1.00 


29.64 


ATOM 2124 


C 


ALA 


1162 


88.024 


65.140 


52.015 


1.00 


27.83 


ATOM 2125 


O 


ALA 


1162 


88.278 


63.937 


52.150 


1.00 


28.97 


ATOM 2126 


N 


GLY 


1163 


88.324 


65.830 


50.922 


1.00 


26.09 


ATOM 2127 


CA 


GLY 


1163 


89.032 


65.216 


49.816 


1.00 


24.79 


ATOM 2128 


C 


GLY 


1163 


88.127 


64.490 


48.845 


1.00 


22.40 


ATOM 2129 


O 


GLY 


1163 


88.607 


63.799 


47.958 


1.00 


22.88 


ATOM 2130 


N 


SER 


1164 


86.819 


64.667 


48.978 


1.00 


23.08 


ATOM 2131 


CA 


SER 


1164 


85.887 


63.988 


48.091 


1.00 


23.47 


ATOM 2132 


CB 


SER 


1164 


84.765 


63.336 


48.893 


1.00 


22.44 


ATOM 2133 


OG 


SER 


1164 


85.308 


62.418 


49.833 


1.00 


24.67 


ATOM 2134 


C 


SER 


1164 


85.314 


64.853 


46.984 


1.00 


23.93 


ATOM 2135 


O 


SER 


1164 


84.470 


64.389 


46.229 


1.00 


25.83 


ATOM 2136 
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SER 


1165 


85.762 


66.102 


46.877 


1.00 


24.65 


ATOM 2137 


CA 


SER 


1165 


85.271 


66.981 


45.815 


1.00 


24.90 


ATOM 2138 


CB 


SER 


1165 


85.812 


68.401 


45.981 


1.00 


26.62 
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OG 


SER 


1165 


85.674 


68.850 


47.316 


1.00 


33.09 


ATOM 2140 


C 


SER 


1165 


85.756 


66.423 


44.487 


1.00 


22.54 


ATOM 2141 


O 


SER 


1165 


86.942 


66.152 


44.327 


1.00 


22.12 


ATOM 2142 


N 


GLY 


1166 


84.843 


66.286 


43.533 


1.00 
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ATOM 2143 
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GLY 


1166 


85.184 


65.741 


42.232 


1.00 
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ATOM 2144 


C 


GLY 


1166 


84.745 


64.288 


42.122 


1.00 


20.47 
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O 


GLY 


1166 


84.756 


63.720 


41.033 


1.00 


19.68 
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GLY 


1167 


84.388 


63.684 


43.258 


1.00 


20.15 


ATOM 2147 


CA 


GLY 


1167 


83.928 


62.303 


43.294 


1.00 


20.19 


ATOM 2148 


C 


GLY 


1167 


82.534 


62.182 


42.712 


1.00 


20.08 


ATOM 2149 


O 


GLY 


1167 


81.798 


63.161 


42.698 


1.00 


21.88 


ATOM 2150 


N 


PRO 


1168 


82.109 


60.986 


42.297 


1.00 


20.02 


ATOM 2151 


CO 


PRO 


1168 


82.901 


59.746 


42.208 


1.00 


20.29 


ATOM 2152 


CA 


PRO 


1168 


80.786 


60.792 


41.710 


1.00 


19.44 


ATOM 2153 


CB 


PRO 


1168 


81.002 


59.553 


40.851 


1.00 
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ATOM 2154 


CG 


PRO 


1168 


81.880 


58.729 


41.726 


1.00 


18.35 


ATOM 2155 


C 


PRO 


1168 


79.578 


60.595 


42.611 


1.00 


19.51 


ATOM 2156 


O 


PRO 


1168 


79.683 


60.068 


43.712 


1.00 


2028 


ATOM 2157 


N 


LEU 


1169 


78.427 


61.050 


42.123 


1.00 


19.50 


ATOM 2158 


CA 


LEU 


1169 
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